Domain adaptation for robust automatic speech recognition in car environments
نویسندگان
چکیده
A major obstacle for the migration of automatic speech recognition into every-day life products is environmental robustness. Automatic speech recognition systems work reasonably well under clean (laboratory) conditions but degrade seriously under real world conditions (e.g. out-door, car). A lot of research work is devoted to increase the environmental robustness of automatic speech recognition systems. A common method is to use clean (office) data as a starting point and simulate the degraded environmental situation by additive artificial (e.g. Gaussian) or recorded noise from the real environment [1]. We study the validity of such additive noise experiments with regard to a real noisy environment. With regard to a previously published work on database adaptation we also examine the possible benefit when using models trained in the simulated environment as a starting point for adaptation ([2]). We present experimental results on data recorded for task-dependent whole word and phoneme modeling in the car environment on data from the the MoTiV Car Speech Data Collection (CSDC) [3].
منابع مشابه
Robust automatic speech recognition for accented Mandarin in car environments
This paper addresses the issues of robust automatic speech recognition (ASR) for accented Mandarin in car environments. A robust front-end is proposed, which adopts a Minimum Mean-Square Error (MMSE) estimator to suppress the background noise in frequency domain, and then implements spectrum smoothing both in time and frequency index to compensate those spectrum components distorted by the nois...
متن کاملStatistical Adaptation of Acoustic M for Robust Speech Re
Noise degrades the performance of Automatic Speech Recognition (ASR) systems working in real condition. The mismatch between the training and recognition conditions is considered the main factor involved in this degradation, and most methods for robust ASR are focussed on its minimization. In this work, we compare robust methods for ASR based on (a) the compensation of the noise effects and (b)...
متن کاملAuditory-based Acoustic Distinctive Features and Spectral Cues for Robust Automatic Speech Recognition in Low-SNR Car Environments
In this paper, a multi-stream paradigm is proposed to improve the performance of automatic speech recognition (ASR) systems in the presence of highly interfering car noise. It was found that combining the classical MFCCs with some auditory-based acoustic distinctive cues and the main formant frequencies of a speech signal using a multi-stream paradigm leads to an improvement in the recognition ...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملFrame-synchronous noise compensation for hands-free speech recognition in car environments - Vision, Image and Signal Processing, IEE Proceedings-
It has become increasingly important to develop hands-free speech recognition techniques for the human-computer interface in car environments. However, severe car noise degrades the speech recognition performance substantially. To compensate the performance loss, it is necessary to adapt the original speech hidden Markov models (HMMs) to meet changing car environments. A novel frame-synchronous...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999